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1. An apparatus for showing the results of a query- 
performed on a data base by a data base system, the apparatus 

including: .■ 

display generation means coupled to the data base system 
for receiving the results and generating a display therefrom; 
and 

display production ? means % /: coupled to the . display 
generation means for receiving ; and t producing the .-generated 

display; 

said apparatus comprising: 

a n-dimensional grid of at least two-dimensions wherein 
each dimension represents a set of Attributable values, 

a set of first areas in ^the ^isplay, each .first area 
corresponding to a subquery which Jdef ines an intersection of 
sets of attribute values speci f ied in jthe query; and 

a symbol of varying appearance J which appears in an area 
of the first areas, the varying -appearance operating to 
indicate a characteristic of the results of the subquery 
corresponding to the area. .^k0$&^.- 

-: :: :SS^Mi AVAltABLE G0PY 



Displaying Query Results 



1 Background of the Invention 

1.1 Field of the Invention 

The invention concerns graphical displays in computer systems in general 
and graphical displays of the results of queries in particular. 

1.2 Description of the Prior Art 

As computers have become cheaper and more powerful, interactive data base 
systems have become widely available. Users in many different areas of en- 
deavor employ interactive data base systems to find items of information in a 
large collection of information. A user of an interactive data base system pro- 
vides the data base system with a query which the user believes will describe 
the items of information being sought. The data base system then responds 
to the query by returning a result which includes all of the information m the 
data base system which satisfies the query. The result generally also includes 
a value which indicates the number of hits, that is, the number of items of 
data which satisfy the query. . ■ • IAI 

FIG. 1 is a block diagram of an example interactive data base system 101. 
The system includes terminal 103, which provides output to and receives 
input from the user, processor 113, xvhich performs the actual searching op- 
crations. and memory system 1 15, which contains programs 119 executed by 
processor 113 and data base 117 which contains the data. In more detail, 
terminal 103 includes a display screen 105, upon which processor 113 displays 
information for the user. Display screen 105 also includes pointer 107, which 
specifies a location in display 105 and may be moved under control of e. her 
keyboard 109 or mouse 1 1 1. The user controls the operation of system 101 by 
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inputs from keyboard 109 and/or mouse 111. Processor 113 may be any kind 
of processor, from a personal computer through a supercomputer. Memory 
system 1 15, finally, includes any data accessible to system 101, and may thus 
include random-access memory, a file system on magnetic or optical disk, 
or even rcmotcly-iocated data bases. When employing system 101 to find 
information in data base 117, the user inputs a query using keyboard 109 or 
mouse 111: processor 113 executes programs 119 as required to perform the 
query on data base 117 and returns the number of hits and the results to 
display screen 105. The user can then use keyboard 109 and/or mouse 111 
to examine the results in more detail. 

The usefulness of interactive data base systems like system 101 is attested 
to by their popularity; present systems do, however, have their limitations. 
A recurring source of frustration among users of interactive data base sys- 
tems is that the systems always seem to return cither too little or too much 
information. A user is typically looking for a half a dozen good pieces of 
information: queries typically produce no hits or a hundred; when the query 
produces no hits, the user must broaden it, whereupon it produces hundreds; 
when it produces too many, the user must narrow it, whereupon it produces 
no hits. Very often, even experienced users are at a loss to find a query which 
will produce a useful number of hits. --lv^-' * 

The problems caused by large numbers of hits are aggravated by the fact 
that interactive data base systems typically only specify the number of hits, 
and thus provide no information which the user can take advantage of to 
subdivide the collection of hits. This tendency to treat the hits as a "black 
bag" also makes it difficult to use interactive data base systems to investigate 
questions like the relationship of the hits to a period of time or to a set of 
authors. It is an object of the invention to solve these and other problems of 
interactive data base systems. 
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The invention solves the foregoing problcm^J>yjnjyi»i«>f^^ on display 
screen 105 which uses spaces^jlisphir^cTc^n 105 to represent subqucries 
and symboUoj^yAtWlypcTand «izcs in the spaces for the subqueries to 
je^»efl<Tira^h^racteristic of the hits for the subquery. In a preferred crnbod- 
imtmt. th* dn«« b«ing gimrmd U a d a ta b a sa o f ct a tisti csl litw a tww and - 



SUMMARY OF THE INVENTION v - -^i^Hp-- ■ 

According to one aspect "of |t he present invention, there 
is disclosed an apparatus for J showing the results of a query 
performed on a data base by a :d^gase; system, the apparatus 
including: ''■<"■:■:'--■(.■. 

display generation means coupled to the data base system 
for receiving the results and generating a display therefrom; 

and * -'....^./v • 

display production means ^ coupled to the display 
generation means for receivingHand ; producing the generated 
display; '^r •*.«•*; ••, -rv 

said apparatus comprising: ) ; 

a n-dimensional grid ofl atlleast two-dimensions wherein 
each dimension represents a set of attributable values, 

a set of first areas in;; the display, each first area 
corresponding to a subquery which defines an intersection of 
sets of attribute values specified in the query; and 

a symbol of varying appearance which appears in an area 
of the first areas, the varying | appearance operating to 
indicate a characteristic of the results of the subquery 
corresponding to the area. / ^^.V 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG 1. is a block diagram of an interactive data base 
system; 

FIG. 2 is a picture of the display in a preferred 
embodiment; 

FIG. 3 is « picture of a grid which is displaying numbers 

of pages; . 

FIG. 4 is a picture of a .grid which is displaying the 

results of a search using an author name; ^.v:-:^,.:.;^.:-^:;.-"..--- ••■ 

FIG. 5 is an overview of ^a ^process 505 which produces 

display 201; -v . '*■•'.< ^ r l|f--'- 

FIG. 6 is a detail of data base files 515; 
FIG. 7 is a detail of search result data 519; 
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PIG. 8 is a detail of display 

FIG. 9 is a diagram showing production of display 20 
Reference numbers ' in the Drawing have two parts the .two, ^ 
least-significant digits are the number; of fan item in a figure; • 
the remaining digits are the number of ; the figure in which the 
item first appears. Thus^an^te^^ ; the reference, number 
201 first appears in FIG. 2. '> \**^V : 

.DETAILED DESCRIPTION OP A^P^FE^g^ODIMENT^ 

The following Detailed De^iipti^will first present a 
description of a preferred en^diment, 4 | ^^^^S^ 
a user and will then describe/^e implementation ^ . the 
preferred embodiment in detail. _ 
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4.1 Environment of the Preferred Embodiment 

The preferred embodiment runs under the UNIX operating system using 
the Xll graphics system with the Motif widget set. (UNIX is a registered 
trademark of Unix Systems Laboratories j- jThe system upon which the pre- 
ferred embodiment executes is a Silicon Graphics Indigo workstation. Speed 
is achieved by using an inverted index, several layers of caching, and memory 
mapped 10. By carefully caching the indices, the system can resolve many 
keyword or author searches in a fraction of a second. Memory mapping each 
of the index files increases speed by saving 10 overhead. Each search is cached 
after it completes, so that executing it again will be nearly instantaneous. 
Screen updates are double buffered using off-screen X pixmaps. ^ v , 

The data base to which the preferred embodiment is applied is the 1992 
Current Index of Statistics (CIS) database. This data base is a reference data 
base for statistics which contains 128,000 references to statistical literature 
published during the last twenty years/: .There is an entry for each article 
or book. The entry contains the following information about the article or 
book: 

• a list of the authors of the article; 



• the title; 

• the journal; 

• the volume of the journal; 

• the beginning page of the article; 

• the end page of the article; 

• the year the article appeared; and gi 

• a list of keywords describing the contents of the article. 

The data base contains neither citation cross references nor the text of 
the articles. 




4.2 The Display: FIG. 2 

FIG. 2 shows display 201 on display screen 105 of the results of a query on 
data base U7 (the CIS data base) in a preferred embodiment. The query 
was a search on the keyword bandits. In the preferred embodiment, display 
201 is a window in display screen 105. At the center of display 201 is grid 
213. At the left of grid 213 are journal names 209, a list of the names of 
those journals in the CIS data base which contain articles for which bandtts 
is listed as a keyword. At the top is a list of years 211 from 1975 through 
1991. Both the names and the years arc displayed in white. If the search 
results in at least some minimum number of hits for a journal in a year (i.e, 
if the journal published at least that number of articles whose entries m the 
data base have the keyword being searched on), a symbol is displayed at 
the point where a horizontal line extending from the journal name meets a 
vertical line extending from the year. The symbol thus represents the results 
of a subqucry of the data base which has been limited to the journal and year 
represented by the symbol's location. The symbols shape, size, and color all 
indicate information about the hits. In display 201 , the minimum number of 
hits is 1: a small square 215 indicates a single hit; circles 217 indicate more 
than one hit, with the size of the circle being proportional to the number ol 
hits. Colors also indicate the number of hits: when the symbol represents a 
single hit, it is white; when it represents two hits, it is blue; when it represents 
three, it is yellow, and when it represents more than three hits it is red. 

As is immediately apparent from FIG. 2, grid 213 shows the user imme- 
diately what journals arc important for the subject he or she is interested in 
and the activity in the subject area over time. Further information is pro- 
vided by bar plot 221 at the bottom of grid 213 and bar plot 223 at the right 
hand side of the grid. Dar plot 221 is a plot of the number of hits (that is. 
articles on the search subject) by year; bar plot 223 is a plot of the number 
of hits by journal. Again, the user immediately sees what journals and what 
years arc important for the subject matter he or she is interested in. 



4.3 Interactive Operation of Display 201 

Display 201 is controlled by means of a pointing device such as mouse 111. 
In a preferred embodiment, mouse 111 has three Nitons; however, only the 
leftmost button is involved in the user interaction with display 201. When 



pointer 107 (not shown in FIG. 2) controlled by mouse 1 1 1 moves over a sym- 
bol 215 or 217, the year and journal for the symbol and the lines connecting 
the symbol with the year and the journal arc displayed in green and the 
journal and year of publication appear in green below grid 213; when pointer 
107 moves over a symbol with the leftmost button depressed, information 
231 from the data base entries represented by the hits appears in article view 
window 229. which is implemented as a Motif scrollable text window. The 
user can then use mouse 111 to scroll through the information represented 
by the selected symbol. 

Further control is by means of buttons and sliders around the margins 
of display 201. The buttons and sliders are operated by means of mouse 
111. Buttons 206 specify the modes of operation: searching, in which the 
data base is searched using a keyword, displaying, in which the results of the 
search are displayed in grid 213, and quit, which terminates execution of the 
program of programs 119 which produces display 201. Hit type selector 203 
consists of two radio buttons, one for articles and the other for pages. When 
the button for articles has been pushed, the symbols in grid 213 indicate 
numbers of articles; when the button for pages has been pushed, they indicate 
the number of pages in the hits represented by the symbol. Sliders 207 
and 205 control grid 213. Slider 207 proportionally scales and shrinks all 
symbols, and thereby lets the user deal with overlaps of symbols; slider 205 
establishes a threshold for the display of symbols; a symbol is displayed only 
if the number of hits is greater than the threshold. Field 225 in display 201 
shows the ratio of hits being displayed to the total hits. In Fig. 2, slider 205 
specifies a threshold of 1, and consequently, all hits are being displayed. 

A search is specified as follows: when the search button in buttons 206 
is indicated, a choices-popup window 227 appears at the lower right-hand 
corner of display 201. Window 207 includes a field for specifying an author 
name or keyword and a Do It button. The user moves the pointer to the field 
and then employs keyboard 109 to input the author name or keyword; when 
that's done, the user employs mouse 1 1 1 to press the Do It button to perform 
the search using the name or keyword specified in the field. When display 201 
is placed in the display mode, the results of the search arc displayed, rhere 
may be more than one window 207; in display 201, there are two. In the 
preferred embodiment, the encaching of the query results makes redrawing 
of grid 213 almost instantaneous; consequently, the results of several searches 
can be compared by placing the display in display mode and pressing the Do 



It buttons for the searches in rapid succession. 



4.4 Other examples of Grid 213 

FIG. 3 shows grid 301 in which the symbol shape represents the number of 
hits as before, but the symbol size represents the number of pages in the 
hits. Thus, a large box such as 303 represents a single reference with a large 
number of pages, or in other words, a book on the subject being searched. 
In a preferred embodiment, the symbol size is a function of the relationship 
between the number of pages in the hits for the symbol to the maximum 
number of pages in the hits for any symbol. Fig. 4 shows how display 201 
can be used to research the places and times of publication by an author on 
a subject. Grid 401 is a result of a search on the author name Bradley Efron. 
It shows that according to the 1992 version of the CIS database Efron has 
published 93 articles. The two most frequent locations of Efron 's articles arc 
J ASA and the Annals of Statistics. Other likely locations arc The American 
Statistician, The Annals of Mathematical Statistics, and Biomelrica. The 
page display (not shown) indicates that he wrote one book, in 1980. One 
particularly interesting feature of grid 401 is that it clearly shows that Efron 
has a low publication year every five years. 

4.5 Other Uses of Display 201 

Of course, displays built on the principles of display 201 can be used in many 
other areas. One example would be a display used with a legal data base. 
In such a display, each field in the grid could represent a court-year pair; the 
display could then be used to discover which courts had been particularly 
active in developing a legal doctrine specified in a search term. The display 
would of course also serve to show how the doctrine came into and fell out of 
favor over a period of time. A display in which the grid represented company- 
year pairs could be used with a patent data base to investigate the activities 
of the companies in various areas of technology represented by the search 
terms. A display in which the grid represented company- technology pairs 
could be used with the patent data base to investigate foreign patent effort 
of the companies for the various technologies. 

Of course, the use of two limiting factors to define the subquery is par- 
ticularly advantageous when the symbols arc displayed in a grid, but the 



invention is not limited to the use of two. For ^example a 
display would permit the use of subqueries defined by three limiting factory 
the symbols would then appear as three-dimensional bodies in a space dc- 
nn Jby the three limiting factors. Additional limiting actors may also be 
introduced which determine the appearance of the symbols, rather than their 
locations in the display. ',. '. 

4.6 Detailed Implementation ? ^ 

In the following, We will present details of the implementation of a preferred 
SnwSl. banning with an overview of the method used to pjrfi*. 
disolav 201. continuing with a discussion of the data structures '^ohed, 
£d eluding with a" discussion of how the data structures are used to 
produce the display. 

4.6.1 Overview of the Method 

In a preferred embodiment, the first step in producing display 201is an 
nitialization step in which the files making up 

Zppcd into memory and a hash table is constructed. 1 he hash tabic > permU 
L quick determination of the locations of all hits on a keyword. The first 
im a user specifies a search on a specific keyword, the keyword ,s used wUh 
the hash tab e to construct a hit list of the locations ,n the search data base 
of the hits The hit list is then used with the search data base to construct 
Inlay daia structures used to generate the display, and the display ,s then 
generated from those data structures. The hit list is retained in memory -until 
fhe chtic "popup window 227 specifying the search which prod uced the , hit 
Z Is dosed, and consequently, once a search has been done, its results may 
be quickly redisplayed. 

4.6.2 Overview of a Process producing a Display 201: FIG. 5 
In a preferred embodiment, display 201 on terminal 103 is f P^uced by a 
Xcrrunning on processor 1 13 and responding to «»P^^ 
min*l 103 FIG 5 Rives an overview of that process. Process 505 has address 
Zee 503 by mca^f which process 505 may reference data in memory 
s^tem 115 Address space 503 includes display code 507, which ,s the code 
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executed by process 503 when producing display 201 and interacting with 
users of terminal 103 upon which display 201 is being displayed. Important 
components of display code 507 arc initialization code 509. which initializes 
the structures used to perform the search, search code 511, which performs 
5 the search, and display drawing code 513, which produces display 201 on 

terminal 103. . . 

The data used by process 505 falls into two categories: data which is 
used by all searches and displays and data of which there is a copy for each 
search and for each display of a search. The data which belongs to the first 

10 category is memory mapped data base files 515 and hash tabic 516. Memory 

mapped data base files 515 arc the files containing the data. These data 
base files 515 have been directly mapped into process 505's address space 
503, so that process 505 can access the contents of the data base files 515 by 
means of relatively rapid memory references rather than much slower file I/O 

15 operations. Hash table 516 contains an entry for each key in the indexes in 

data base files 515. The contents of the entry is a list of the locations of the 
first occurrence of that key in the actual data files of the data base. When 
given a kev. the hash table quickly produces the list of locations for the key. 
Pcr-search data 517 includes search result data 519 and display data 521. 

20 The most important part of search result data 519 is a list of the locations 

of all of the hits on the search in data base files 515. As mentioned above, 
search result data 519 is retained as long as there is a window 227 specifying 
the search. Display data 521 is the data structures which process 505 uses 
when executing display portion 513 of code 507 to produce display 201 on 

25 terminal 103. 

4.6.3 Details of Memory Mapped Data Base Files 515: FIG. 6 

FIG. 6 shows the details of memory mapped data base files 515. There are 
three kinds of files: a data base file (DHfilc) 623 for each year of the database, 
30 which contains the actual data, an index file (IxFile) 657 for each data base 

file 623. which contains a list by order of index key of the locations < of each 
occurrence of the item specified by the index key in data base file 623, and 
INDEX file 641, which contains a list of hits for all of the data base files 
623. In the preferred embodiment, the data base files 623 arc Provided by 
the data base provider, along with a software tool for making an Ixfilc 65 1 
for each data base file 623 and an INDEX file 611 for the entire data base. 



Continuing in more detail with data base files 623, each data base file 
623 is identified by a code and by a name: The name is the year for which 
the data base file 623 contains entries. There is a data base entry 625 in the 
file for each article published during the year in the journals reported by the 
data base. Entry 625 contains the following fields: • 

• J YEAR 626, which contains the year of the journal in which the item 
appeared; 

• JTITLE 627. which contains the title of the journal in which the article 
appeared; ■ y'v :■ 

• J VOL 629, which contains the volume number of the journal; 

• PSTART 631 . which contains the starting page of the article; 

• PEND 633, which contains the ending page of the article; 

• TITLE 635. which contains the article'? title; 

• Authors 637. which is a list of the arUdcs authors; and 

• Keywords 639, which is a list of the keywords which describe the art i- 
cle's content. : 

IxFile 657 is simply a list of pointers 659 to the locations of hits in tho data 
base file 623 to which IxFile 657 corresponds^! _ 

In the preferred embodiment, data base files 623 and IxFilcs 6oi are 
located for memory mapping by means of data base file list 601 . which has a 
data base file list entry 603 for each data base file 623. The contents of entry 
603 arc 

• file code 605, an alphanumeric code identifying the data base file 623: 

• database name 607, the name of the data base contained in file 623; in 
this case, the name is the year to which the data base entries in the file 
belong; ;|;lrv ' 

. data base file name (DBFNAME) 609. which is the actual file name of 
data base file 623; 
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• data base file size 611. which »» ' he size of data base file 623 in bytes; 

• index file name (IXFNAME) 613. which is the name of IxFilc 657 
associated with the data base file 623: and 

• index file size 619. which is th- size oi the associated IxFile 657. 

A static variable 621 contains the number of entries in data base file list 601. 

Continuing with index file 641, index file 641 contains a search key entry 
(SKE) 643 for each search key used to search data base file 623. Each search 
key entry 643 contains search key 647 and a data b?se file entry 645 for each 
data base file 623 in which there is an occurrence (or "hit") of search key 
647. Data base file entry 645 for a data base file contains file code 649 for 
the file, number of hits 651. which contains the number of hits for search key 
647 in the file, and first hit location 653, which specifies the byte in the file 
at which the first hit appears. ^ ,. mpY 

As part of initialization, data base files 623, IxFiles 6oi, and INDEX ^file 
641 are memory mapped: further, hash table 516 is made from index file 641. 
Each entry in hash table 516 contains a single search key entry 643 from 
index file 641. and the search key entry 643 for a search key may be accessed 
by providing the search key to hash table 516. 

4.6.4 Details of Search Result Data 519: FIG. 7 
Search result data 519 consists of hit list 701 and number of hits 705. Hit 
list 701 is a list of hits resulting from a single search. The elements or the 
list are line pointers 703 to data base entries 625 in data base .file, i «M 1 winch 
contain the keyword or author for which the search was made. MinSito 
variable which specifies the number of elements presently in hit Ust iOI. 



is a 



4.6.5 Details of Display Data 521: FIG. S 

Display data 521 contains the information needed to draw display 201 for a 
given search. First, there is journal list 801, which is a list of the names 803 of 
all of the journals for which the given search resulted in hits. - */<w^ A » 
805 indicates the number of journal, in journal list 801. JUT LIST SO, has 
an entry *09 corresponding to each journal name in journal list 801. journal 
hits 809 contains the total number of hits produced by the search for the 
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journal represented by the entry 809. YHIT LIST 811 is the equivalent for 
years; there is an entry 813 for each year contained in the data base, and the 
entry 813 contains the total number of hits produced by the search for that 

There are five static variables which control aspects of display 201. The 
variable scale size 823 is set by slider 207, and indicates the size to which 
symbols 217 are to be scaled; threshold 825 is set by slider 205, and indicates 
the number of hits which are required for a symbol 215 or 217 to appear in 
display 201. Display .mode 827 indicates whether hit numbers or total page 
numbers are to be displayed, and is set by the radio buttons in hit type 
selector 203. Maxhits 829 is the largest number of hits in any element 817 
of number of hits grid 815, and is computed as grid 815's elements are set; 
Maxpages 831 is the equivalent value for number of pages grid 819. As will 
be described in more detail later, these values arc used with scale size 823 to 
compete the sizes of symbols 215 or 217. : 

Number of hits grid 815 and number of pages grid 819 contain the infor- 
mation used to produce symbols in grid 213. Grid 815 is a two-dimensional 
array. There is an entry 817 for each journal-year pair, and the entry con- 
tains the number of hits produced by the search for that journal-year pair. 
Number of p?gcs grid 819 is a similar data structure, except that entry 818 
for each journal-year pair contains the number of pages in the hits produced 
by the search for that journal-year pair. Hit pointer grid 821 is also similar. 
There, entry 822 for each journal-year pair contains a pointer to an array 
of pointers to the data base entries 625 for the hits. As will be explained 
in more detail later, hit pointer grid 821 is used to produce the display in 
article view window 229. 

4.6.6 Operation of the Implementation: FIG. 9 

FIG. 9 i« a diagram 901 showing how routines in display code 607 and the 
files and data structures just describe interact to produce display 201. By the 
time a search begins, data base files 515 have been mapped into address space 
503 and hash table 516 has been produced from INDEX file 641. Routines 
belonging to search code 511 appear in FIG. 9 in the portion labelled 511; 
those belonging to display code 513 appear in the portion labelled 513. 

As previously indicated, a user specifies a search by typing the term to be 
searched for into choices-popup window 207 and then pushing the Do it 



button in the window. The term which was typed into the window appears 
in FIG. 9 as search key (SKEY) 902. The routine hsearch applies search key 
902 to hash table 516 and returns search key entry 643 for search key 902 from 
hash table 516. Search key entry (SKE) 643 of course contains information 
permitting location of all of the hits for the search key in the data base. 
Search key entry 643 is provided to the routine resolve, which reads each 
data base file entry 645 in turn. For each hit indicated in search key entry 
643 resolve makes an entry in hit list 701 which contains a pointer to data 
base cntrv 625 for the hit. If there is only one hit indicated by the entry 645, 
resolve uses first hit loc. 653 to derive the pointer; if there is more 
than one, resolve uses IxFile 657 for the data base file 623 corresponding 
to the data base file entry 645 to obtain the remaining locations, and derives 
the pointers for hit list 657 from those locations. 

Display data 521 is constructed by the routine cisjournals906. cisjournals 
906 obtains the information for display data 621 from data base files 603 with 
the help of parse.cis.line 907, which takes a pointer to a data base en- 
try 625 and returns the fields in the data base entry 625. cisjournals 906 
first constructs journal list 801. It does this by going through hit list (01 
and providing each pointer in the list *o parse.cis .line 907, as shown by 
arrow 913. parse.cis Jine 907 uses the pointer to locate data base entry 
625 and return it to cisjournals 906. cisjournals 906 makes a tree from 
the journal titles 627 from the returned data base entries 625 which has one 
node for each unique journal title in the data base entries 625 returned by 
parse.cisJ.ine 907. After cisjournals 906 has worked through hit list 
701, it copies the journal ises in the tree into the journal name fields 803 of 
journal list 801. Then it sorts journal list 801 so that the titles are in alpha- 
betical order. The next step is to initialize all entries indicat ing^ number of 
hits, number of pages, and pointers to hits in arrays 807, 811, 815, 819, and 
821 to 0. Then cisjournals 906 puts values in arrays 807, 811, 817, eia, 
and 821 by again working thorough hit list 701 and providing each pointer 
to parse.ciaJ.ine 907, which again extracts data base entry 625 specified 

by the pointer. n/ . m . . . 

For each data b*»e entry 625 returned by parse.cisJ.ine 90 1, cis j journals 
906 first takes the title in JTITLE 627 and uses it to find the index in journal 
list 801 of that title in journal list 801. Then JIIITS 809 in the corresponding 
entry of JHITLIST 807 is incremented. Next, cisjournals 906 takes the 
value of J YEAR 
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626 and uses it to find the entry in YHITLIST 811 for the year and 
increments that entry 813. Then the routine takes the values of P START 
631 and PEND 633. and uses them to compute the number of pages specified 
by the hit. That value is then added to the value presently in number of pages 
element 818 of number of pages grid 819 for the journal and year specified in 
database entrv 625. Then the value in number of hits element 817 in number 
of hits grid 815 is incremented. Finally, the line pointer 703 in hit list 701 
which was used to obtain data base entry 625 is placed in hit pointer list 822 
in the entrv in hit pointer grid 821 for the journal and the year. 

Thus, after cis journals 906 has worked through hit list 701 the second 
time, journal list 801 contains a sorted list of journal names, JHITLIST 807 
contains the total number of hits for each journal, YHITLIST 811 contains 
the total number of hits for each year, no. of hits grid 815 contains the 
number of hits for each journal-year pair, no. of pages grid 819 contains the 
total number of pages in the hits for each journal-year pair, and hit pointer 
grid 821 contains the pointers to the data base entries 625 for each hit for 
each journal-year pair. 

Display data 521 is used by the callback routine drav_display.call 9U9, 
which is called whenever a user interaction with display 201 requires that the 
entire display be redrawn. The entire display 201 is redrawn in a screen buffer 
in the background, which is then displayed when the redrawing is complete. 
The first step in the redraw operation is drawing journal list 209 and bar plot 
223. The names in journal list 209 come from journal list S01 and the totals 
come from JHITLIST 807. Next, the years in year list 211 and bar plot 221 
are drawn: the totals come from YHITLLIST 811. Then the symbols arc 
drawn in grid 213 for each year-journal pair. The drawing is done . by the 
routine draw symbol 910, which is invoked by drav.display.call 909 if the 
number of hits in element 817 for the year-journal pair is greater than the 
value specified in THRESHOLD 825. . 

draw .symbol 919 is invoked with the indices of the year-journal pair in 
grids 815 and 819 and with an argument indicating whether the redraw is 
part of an entire screen redraw (all indicated by arrow 917). drav-symbol 
919 uses the indices to locate number of hits element 817 and obtains the 
value in the element. Then the routine proceeds as follows: if display mode 
827 indicates that the number of hits is to be displayed, the routine computes 
a symbol size value using the expression scale .siza • number^f -hits / 
maxhits. If display mode 827 indicates that the total number of pages for the 



hits is to determine symbol size, drav_symbol obtains the value of number 
of pages 818 for the article-year pair and computes the symbol size value like 
this: scale-size * nuraber_of .pages / maxpages. 

Then the location of the symbol 215 or 217 in grid 213 is computed and 
the symbol is drawn. If the number of hits in clement 817 is 1, the symbol 
is drawn as a white rectangle having the size specified by the symbol size 
value; if the number of hits is 2, the symbol is drawn as a blue circle having 
the size specified by the symbol size value; otherwise, it is drawn as a red 
circle having the size specified by the symbol size value. When draw symbol 
draws a symbol in response to pointer 107 being positioned on the symbol, 
it proceeds exactlv as set forth above, except that the symbol color is green. 

After having used drav_symbol to draw the symbol corresponding to each 
value in number of hits grid 815 or no. of pages grid 819, draw.display.call 
909 prints field 225, completing those portions of display 201 not contained 
in other windows. 

4.6.7 Responding to Selection of a Symbol 

As pointed out in the description of display 201, when pointer 107 is moved 
onto a svmbol in grid 213, the symbol, the journal in column 209, and the 
year in row 211 are all displayed in green and the name of the journal, the 
year, and the number of hits appear below grid 213. That is done as by 
employing draw_symbol 919 to redraw the symbol in green; then the journal 
and year for the symbol are redrawn in green; finally, the indices of the entry 
in end 815 or 819 are used to locate the journal name in journal list 801 and 
the number of hits for the journal name in JH1TLIST 811. These, and the 
year of the journal (again determined from the indices) are written below 
rid 213 

gn As further pointed out, when pointer 107 is moved onto a symbolin grid 
213 and the leftmost button of mouse 111 is depressed, information 231 for 
the hits represented by the symbol appears in article view window 229. In the 
preferred embodiment, information 231 is obtained by means of hit pointer 
grid 821 and parse.cisJLine 907. Article view window 229 is a windoxv 
into a buflcr which contains the information, and as previously explained, 
article view window 229 may be scrolled through the information. In the 
preferred embodiment, when the leftmost button of mouse 111 is depressed 
while pointer 107 is on a symbol, the pointers in the list of pointers at the 
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entry in hit pointer grid 821 which has the publication and year specified by 
the symbol are provided one at a time to parse.cis.line 907, which returns 
data'base entrv 625 for the hit. The information from entry 625 which is 
to be displayed in article view window 229 is then output to the buffer. 
Thus, bv scrolling article view window 229, the user of the system can see 
that information for each of the hits. In the preferred embodiment, the 
information provided from data base entry 625 includes the author, the title, 
the journal, the pages, the year, and the keywords. Provision of the keywords 
of course permits the user to make other searches using the keywords. 



5 Conclusion 

The foregoing Detailed Description has disclosed a new technique for dis- 
playing the results of queries and has shown how the technique can Reused 
to display a query result as a set of results of subqucrics. The Detailed De- 
scription has further shown how the technique can be used to obtain views 
of information not previously possible and how the displays produced using 
the technique may be interactively controlled. 

While the preferred embodiment disclosed herein is the best presently 
known to the inventors, it will be apparent to those of ordinary skill in 
the art that many other implementations which employ the Peoples of 
the inventions disclosed herein are possible. Further, while grid 213 is a 
particularly advantageous form of display, the principles of the inventions 
disclosed herein apply to any arrangement in which an area of a display is 
related to a subqucry and a symbol indicating the results of the subqucry is 
displayed in the area. Moreover, the uses of the techniques of the invention 
are not limited to those disclosed herein. Rather, the techniques may be used 
In any situation in which subqueries may be related to areas of the screen, 
and the variations in the appearance of the symbols may represent any kind 
of information about the results of the subquery. . 

All of the above being the case, the foregoing Detailed Description is 
to be understood as being in every respect illustrative and exemplary, but 
not restrictive, and the scope of the invention disclosed herein is not to be 
determined from the Detailed Description, but rather from the claims as 
interpreted with the full breadth permitted by the law. 

What Is claimed is: 
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The claims defining the invention! are as follows: 

1. An apparatus for showing the results of a query 
performed on a data base by a data' base system, the apparatus 
including: 

display generation means coupled to the data base system 
for receiving the results and generating a display therefrom; 

and . 

display production means coupled to the display 

generation means for receiving and producing the generated 

display; 

said apparatus comprising: 

a n-dimensional grid of at least two-dimensions wherein 
each dimension represents a set of attributable values, 

a set of first areas in the display, each first area 
corresponding to a subquery which defines an intersection of 
sets of attribute values specified in the query; and 

a symbol of varying appearance which appears in an area 
of the first areas, the varying appearance operating to 
indicate a characteristic of the results of the subquery 
corresponding to the area. 

2. The apparatus set forth in claim 1, wherein: 

the symbol of varying appearance has a shape which varies 
according to the characteristic of the results of the subquery 
corresponding to the area of the first areas. 

3. The apparatus set forth in claim 1 wherein: 

the symbol of varying appearance has a size which varies 
according to the characteristic^ ^the results of the subquery 
corresponding to the area of the first areas. 

4 The apparatus set forth in claim 1 wherein: 
the symbol of varying appearance has a colour which 
varies according to the characteristic of the results of the 
subquery corresponding to the area of the first areas. 

5. The apparatus set forth in claim 1 wherein: 
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the characteristic according to which the appearance 
varies is a number of hits produced by the subquery. 

6. The apparatus set forth in claim 5 wherein: 

the characteristic according to which the appearance 

varies is a number of pages in the number of hits produced by 

the subquery. 
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apparatus 



set forth in claim 1 



further 



7 . The 
comprising: 

means in the display generation means for establishing a 
threshold value for the characteristic, the symbol appearing in 
a given one area of the set of first areas only if the subquery 
corresponding to the given area of the first areas produces a 
result in which the characteristic exceeds the threshold value. 

8. The apparatus set forth in claim 7 wherein the 
means for establishing a threshold value is interactive. 

20 9 . The apparatus set forth in claim 1 further 

comprising: 

means in the display generation means for adjusting the 
size of the symbol in the area o^ the ; first areas . 

25 10 . The apparatus set forth in claim 9 wherein: 

the means for adjusting the size of the symbol is 

interactive . 

11. The apparatus set forth in claim 1 further 

30 comprising; 

means for selecting the symbol; and 

means in the display generation means responsive to 
selection of the symbol by the symbol selecting means for 
generating the display such that , the display shows further 
36 information about the subquery represented by the symbol. 



12. 




The apparatus set forth in claim 11 wherein: 
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the further information "is ^information . about the hits 
produced by the subquery . ■. y^m^.;; .. 

. .•- i ■ +rX ■ -.v ; ''V; 1 : <J ; / ,r,.. ? . . 

13 The " • fnrhh i n ; 

comprising: 

means for interactively specifying the query; and 
means in the display generation' mean to the 

means for interactively specif yin^ the; query for producing the 
set of first areas as required by the, query and the subqueries. 
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there is 



The apparatus set forth Jin claim 13 wherein: 
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specifying the query; and ' W^^t -"' 

the means for producing the -set 'of first areas includes 
means for encaching the results or the subqueries, ;i 

whereby a plurality of sets of ^irst areas corresponding 
to the queries specified by"5 the plurality of means for 
interactively specifying the query, may 'be displayed in rapid 
succession. *|| ||| ; |£; 

15 The apparatus set forth, in claim 1 further 

comprising: & pi® f| - 

a set of second areas in the /display for identifying the 
subqueries, areas in the set of second areas being . spatially 
related to the set of first areas,: such that the subquery to 
which a given area of the set-off first areas belongs can be 
determined by one or more areas Hn _ the set of second areas, 
related thereto. • \ r k- -.<- ;l ; , 

16. The apparatuelT^jr^ 

there is an area of the' second areas correspond! 
each of the dimensions of the 9*"^ . . 

17 The apparatus set. li orth^n%iaim ',ij|^herein : 
th ond - i fsirinaiedaefl sofethe - qi 



to 



18. The apparatus settf o^J^n^i^i^ "l7 ; wherein : 
the number of dimensions is;.2 
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there is a first second area located along a first edge 
of the grid and a second second area located along a second 
edge of the grid which is adjacent to the first edge; and 

the first second area and the second second area relate 
to the set of first areas such that a first line which passes 
through a given one of the first areas and is parallel to the 
first edge and a second line which passes through the given one 
of the first areas and is parallel to the second edge pass 
through portions of the first second areas and the second 
second area which define the subquery for the given one of the 
first areas. 

19. The apparatus set forth ; in claim 16 further 

comprising: 

means for selecting the symbol; and 

means in the display generation means responsive to the 
means for selecting the symbol for changing the appearance of 
the symbol and of second areas which correspond to the area in 
which the symbol appears. 

20 The apparatus set forth . in claim 16 further 
comprising : 

a set of third areas in the display, each area of the set 
of third areas corresponding to a' one-dimensional array of 
first areas for displaying a summation of the characteristics 
of the results represented by the one-dimensional array of 
first areas . 

21. An apparatus for showing the results of a query 
performed on a data base by a data base system substantially as 
hereinbefore described with reference to Figs. 2 to 9 of the 
accompanying drawings. 

DATED this Sixteenth Day of January 1997 

AT&T Corp . ; 

Patent Attorneys for the Applicant 
SPRUSON & FERGUSON 
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Displaying Query Results 
Abstract " " • 

A technique for displaying the results of a query on a database. 
The technique associates a subquery of the query with a space 1n a 

5 display and represents the results of the subquery by means of a symbol 
whose appearance varies with a characteristic of the results of the 
subquery. In a preferred embodiment, the , technique 1s used with queries 
on a data base of statistical articles. The display (201) 1s a grid 
(213) with each field of the grid (213)jrepresent1ng a journal-year pair 

10 (209,211). A symbol (215,217) appears In ; the field 1f a search of the 
data base results In a minimum number of hits 1n the year (211) and 
journal (209) represented by the pair. The appearance of the symbol 
(215, 217) varies with properties such as the number of hits and the 
number of pages 1n the hits. Bar graphs (221, 223) along the sides of 

15 the grid (213) Indicate total numbers of hits per journal (223) and year 
(221). The symbol (215,217) for a given journal -year pair changes Us 
color when the pointer 1s moved over 1t. and 1f a mouse button 1s 
depressed when the pointer 1s over the symbol (215,217), the data base 
references for the hits corresponding to the journal-year pair are 

20 displayed In an article view window (229). Sliders (205,207) control the 
number of hits required for the display of a symbol (205) and the size of 
the symbols (207). The Implementation; employs, memory mapping and caching 
techniques to achieve rapid generation V of -a display (201) from the 
results of a search. 
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